Overview

Dataset statistics

Number of variables26
Number of observations300000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory59.5 MiB
Average record size in memory208.0 B

Variable types

Numeric16
Categorical10

Warnings

cat5 is highly correlated with cont12High correlation
cont12 is highly correlated with cat5High correlation
id is uniformly distributed Uniform
id has unique values Unique

Reproduction

Analysis started2021-08-28 03:03:28.413191
Analysis finished2021-08-28 03:05:31.759516
Duration2 minutes and 3.35 seconds
Software versionpandas-profiling v3.0.0
Download configurationconfig.json

Variables

id
Real number (ℝ≥0)

UNIFORM
UNIQUE

Distinct300000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean250018.5769
Minimum1
Maximum499999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 MiB

Quantile statistics

Minimum1
5-th percentile24987.95
Q1124772.5
median250002.5
Q3375226.5
95-th percentile475032.1
Maximum499999
Range499998
Interquartile range (IQR)250454

Descriptive statistics

Standard deviation144450.15
Coefficient of variation (CV)0.5777576681
Kurtosis-1.202249332
Mean250018.5769
Median Absolute Deviation (MAD)125228
Skewness0.0001529463078
Sum7.500557308 × 1010
Variance2.086584584 × 1010
MonotonicityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20471
 
< 0.1%
5971
 
< 0.1%
2217531
 
< 0.1%
2278981
 
< 0.1%
2156121
 
< 0.1%
2197101
 
< 0.1%
2176631
 
< 0.1%
476821
 
< 0.1%
456351
 
< 0.1%
353961
 
< 0.1%
Other values (299990)299990
> 99.9%
ValueCountFrequency (%)
11
< 0.1%
21
< 0.1%
31
< 0.1%
41
< 0.1%
61
< 0.1%
71
< 0.1%
81
< 0.1%
91
< 0.1%
101
< 0.1%
111
< 0.1%
ValueCountFrequency (%)
4999991
< 0.1%
4999981
< 0.1%
4999971
< 0.1%
4999961
< 0.1%
4999931
< 0.1%
4999921
< 0.1%
4999891
< 0.1%
4999881
< 0.1%
4999851
< 0.1%
4999801
< 0.1%

cat0
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
A
193130 
B
106870 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters300000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowB
2nd rowB
3rd rowA
4th rowB
5th rowA

Common Values

ValueCountFrequency (%)
A193130
64.4%
B106870
35.6%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
a193130
64.4%
b106870
35.6%

Most occurring characters

ValueCountFrequency (%)
A193130
64.4%
B106870
35.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter300000
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A193130
64.4%
B106870
35.6%

Most occurring scripts

ValueCountFrequency (%)
Latin300000
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
A193130
64.4%
B106870
35.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII300000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A193130
64.4%
B106870
35.6%

cat1
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
A
154824 
B
145176 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters300000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowB
2nd rowB
3rd rowA
4th rowB
5th rowA

Common Values

ValueCountFrequency (%)
A154824
51.6%
B145176
48.4%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
a154824
51.6%
b145176
48.4%

Most occurring characters

ValueCountFrequency (%)
A154824
51.6%
B145176
48.4%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter300000
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A154824
51.6%
B145176
48.4%

Most occurring scripts

ValueCountFrequency (%)
Latin300000
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
A154824
51.6%
B145176
48.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII300000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A154824
51.6%
B145176
48.4%

cat2
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
A
253886 
B
46114 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters300000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowB
2nd rowA
3rd rowA
4th rowA
5th rowA

Common Values

ValueCountFrequency (%)
A253886
84.6%
B46114
 
15.4%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
a253886
84.6%
b46114
 
15.4%

Most occurring characters

ValueCountFrequency (%)
A253886
84.6%
B46114
 
15.4%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter300000
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A253886
84.6%
B46114
 
15.4%

Most occurring scripts

ValueCountFrequency (%)
Latin300000
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
A253886
84.6%
B46114
 
15.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII300000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A253886
84.6%
B46114
 
15.4%

cat3
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
C
263356 
A
31726 
D
 
4328
B
 
590

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters300000
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowC
2nd rowA
3rd rowC
4th rowC
5th rowC

Common Values

ValueCountFrequency (%)
C263356
87.8%
A31726
 
10.6%
D4328
 
1.4%
B590
 
0.2%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
c263356
87.8%
a31726
 
10.6%
d4328
 
1.4%
b590
 
0.2%

Most occurring characters

ValueCountFrequency (%)
C263356
87.8%
A31726
 
10.6%
D4328
 
1.4%
B590
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter300000
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
C263356
87.8%
A31726
 
10.6%
D4328
 
1.4%
B590
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Latin300000
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
C263356
87.8%
A31726
 
10.6%
D4328
 
1.4%
B590
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII300000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C263356
87.8%
A31726
 
10.6%
D4328
 
1.4%
B590
 
0.2%

cat4
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
B
294737 
A
 
2978
C
 
1772
D
 
513

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters300000
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowB
2nd rowB
3rd rowB
4th rowB
5th rowB

Common Values

ValueCountFrequency (%)
B294737
98.2%
A2978
 
1.0%
C1772
 
0.6%
D513
 
0.2%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
b294737
98.2%
a2978
 
1.0%
c1772
 
0.6%
d513
 
0.2%

Most occurring characters

ValueCountFrequency (%)
B294737
98.2%
A2978
 
1.0%
C1772
 
0.6%
D513
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter300000
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
B294737
98.2%
A2978
 
1.0%
C1772
 
0.6%
D513
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Latin300000
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
B294737
98.2%
A2978
 
1.0%
C1772
 
0.6%
D513
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII300000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
B294737
98.2%
A2978
 
1.0%
C1772
 
0.6%
D513
 
0.2%

cat5
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
B
149340 
D
126137 
C
20248 
A
 
4275

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters300000
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowB
2nd rowD
3rd rowD
4th rowD
5th rowD

Common Values

ValueCountFrequency (%)
B149340
49.8%
D126137
42.0%
C20248
 
6.7%
A4275
 
1.4%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
b149340
49.8%
d126137
42.0%
c20248
 
6.7%
a4275
 
1.4%

Most occurring characters

ValueCountFrequency (%)
B149340
49.8%
D126137
42.0%
C20248
 
6.7%
A4275
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter300000
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
B149340
49.8%
D126137
42.0%
C20248
 
6.7%
A4275
 
1.4%

Most occurring scripts

ValueCountFrequency (%)
Latin300000
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
B149340
49.8%
D126137
42.0%
C20248
 
6.7%
A4275
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII300000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
B149340
49.8%
D126137
42.0%
C20248
 
6.7%
A4275
 
1.4%

cat6
Categorical

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
A
290511 
B
 
8018
C
 
928
D
 
292
I
 
136
Other values (3)
 
115

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters300000
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA
2nd rowA
3rd rowA
4th rowA
5th rowA

Common Values

ValueCountFrequency (%)
A290511
96.8%
B8018
 
2.7%
C928
 
0.3%
D292
 
0.1%
I136
 
< 0.1%
H56
 
< 0.1%
E45
 
< 0.1%
G14
 
< 0.1%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
a290511
96.8%
b8018
 
2.7%
c928
 
0.3%
d292
 
0.1%
i136
 
< 0.1%
h56
 
< 0.1%
e45
 
< 0.1%
g14
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
A290511
96.8%
B8018
 
2.7%
C928
 
0.3%
D292
 
0.1%
I136
 
< 0.1%
H56
 
< 0.1%
E45
 
< 0.1%
G14
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter300000
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A290511
96.8%
B8018
 
2.7%
C928
 
0.3%
D292
 
0.1%
I136
 
< 0.1%
H56
 
< 0.1%
E45
 
< 0.1%
G14
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin300000
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
A290511
96.8%
B8018
 
2.7%
C928
 
0.3%
D292
 
0.1%
I136
 
< 0.1%
H56
 
< 0.1%
E45
 
< 0.1%
G14
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII300000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A290511
96.8%
B8018
 
2.7%
C928
 
0.3%
D292
 
0.1%
I136
 
< 0.1%
H56
 
< 0.1%
E45
 
< 0.1%
G14
 
< 0.1%

cat7
Categorical

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
E
276040 
D
 
12144
B
 
8297
G
 
2870
F
 
562
Other values (3)
 
87

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters300000
Distinct characters8
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowE
2nd rowF
3rd rowD
4th rowE
5th rowE

Common Values

ValueCountFrequency (%)
E276040
92.0%
D12144
 
4.0%
B8297
 
2.8%
G2870
 
1.0%
F562
 
0.2%
C36
 
< 0.1%
A31
 
< 0.1%
I20
 
< 0.1%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
e276040
92.0%
d12144
 
4.0%
b8297
 
2.8%
g2870
 
1.0%
f562
 
0.2%
c36
 
< 0.1%
a31
 
< 0.1%
i20
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
E276040
92.0%
D12144
 
4.0%
B8297
 
2.8%
G2870
 
1.0%
F562
 
0.2%
C36
 
< 0.1%
A31
 
< 0.1%
I20
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter300000
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E276040
92.0%
D12144
 
4.0%
B8297
 
2.8%
G2870
 
1.0%
F562
 
0.2%
C36
 
< 0.1%
A31
 
< 0.1%
I20
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin300000
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
E276040
92.0%
D12144
 
4.0%
B8297
 
2.8%
G2870
 
1.0%
F562
 
0.2%
C36
 
< 0.1%
A31
 
< 0.1%
I20
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII300000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E276040
92.0%
D12144
 
4.0%
B8297
 
2.8%
G2870
 
1.0%
F562
 
0.2%
C36
 
< 0.1%
A31
 
< 0.1%
I20
 
< 0.1%

cat8
Categorical

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
C
111103 
E
79844 
A
76585 
G
26128 
D
 
5187
Other values (2)
 
1153

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters300000
Distinct characters7
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowC
2nd rowA
3rd rowA
4th rowC
5th rowA

Common Values

ValueCountFrequency (%)
C111103
37.0%
E79844
26.6%
A76585
25.5%
G26128
 
8.7%
D5187
 
1.7%
F966
 
0.3%
B187
 
0.1%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
c111103
37.0%
e79844
26.6%
a76585
25.5%
g26128
 
8.7%
d5187
 
1.7%
f966
 
0.3%
b187
 
0.1%

Most occurring characters

ValueCountFrequency (%)
C111103
37.0%
E79844
26.6%
A76585
25.5%
G26128
 
8.7%
D5187
 
1.7%
F966
 
0.3%
B187
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter300000
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
C111103
37.0%
E79844
26.6%
A76585
25.5%
G26128
 
8.7%
D5187
 
1.7%
F966
 
0.3%
B187
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin300000
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
C111103
37.0%
E79844
26.6%
A76585
25.5%
G26128
 
8.7%
D5187
 
1.7%
F966
 
0.3%
B187
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII300000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C111103
37.0%
E79844
26.6%
A76585
25.5%
G26128
 
8.7%
D5187
 
1.7%
F966
 
0.3%
B187
 
0.1%

cat9
Categorical

Distinct15
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.3 MiB
F
71249 
I
59218 
G
28253 
L
20958 
H
19925 
Other values (10)
100397 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters300000
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN
2nd rowO
3rd rowF
4th rowK
5th rowN

Common Values

ValueCountFrequency (%)
F71249
23.7%
I59218
19.7%
G28253
 
9.4%
L20958
 
7.0%
H19925
 
6.6%
K18057
 
6.0%
N16704
 
5.6%
B14477
 
4.8%
J14266
 
4.8%
O14203
 
4.7%
Other values (5)22690
 
7.6%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
f71249
23.7%
i59218
19.7%
g28253
 
9.4%
l20958
 
7.0%
h19925
 
6.6%
k18057
 
6.0%
n16704
 
5.6%
b14477
 
4.8%
j14266
 
4.8%
o14203
 
4.7%
Other values (5)22690
 
7.6%

Most occurring characters

ValueCountFrequency (%)
F71249
23.7%
I59218
19.7%
G28253
 
9.4%
L20958
 
7.0%
H19925
 
6.6%
K18057
 
6.0%
N16704
 
5.6%
B14477
 
4.8%
J14266
 
4.8%
O14203
 
4.7%
Other values (5)22690
 
7.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter300000
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
F71249
23.7%
I59218
19.7%
G28253
 
9.4%
L20958
 
7.0%
H19925
 
6.6%
K18057
 
6.0%
N16704
 
5.6%
B14477
 
4.8%
J14266
 
4.8%
O14203
 
4.7%
Other values (5)22690
 
7.6%

Most occurring scripts

ValueCountFrequency (%)
Latin300000
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
F71249
23.7%
I59218
19.7%
G28253
 
9.4%
L20958
 
7.0%
H19925
 
6.6%
K18057
 
6.0%
N16704
 
5.6%
B14477
 
4.8%
J14266
 
4.8%
O14203
 
4.7%
Other values (5)22690
 
7.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII300000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
F71249
23.7%
I59218
19.7%
G28253
 
9.4%
L20958
 
7.0%
H19925
 
6.6%
K18057
 
6.0%
N16704
 
5.6%
B14477
 
4.8%
J14266
 
4.8%
O14203
 
4.7%
Other values (5)22690
 
7.6%

cont0
Real number (ℝ)

Distinct299632
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5273354349
Minimum-0.1180394752
Maximum1.058443368
Zeros0
Zeros (%)0.0%
Negative4249
Negative (%)1.4%
Memory size2.3 MiB

Quantile statistics

Minimum-0.1180394752
5-th percentile0.1701039938
Q10.4059654463
median0.4970530352
Q30.6680601291
95-th percentile0.9901239973
Maximum1.058443368
Range1.176482844
Interquartile range (IQR)0.2620946829

Descriptive statistics

Standard deviation0.2305992958
Coefficient of variation (CV)0.4372914857
Kurtosis0.1416191427
Mean0.5273354349
Median Absolute Deviation (MAD)0.09741693483
Skewness0.2363602942
Sum158200.6305
Variance0.05317603522
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.99967693072
 
< 0.1%
0.39963911452
 
< 0.1%
0.40463740612
 
< 0.1%
0.800014482
 
< 0.1%
0.4175351342
 
< 0.1%
0.41478971622
 
< 0.1%
0.40416652432
 
< 0.1%
0.81712657992
 
< 0.1%
0.6446659752
 
< 0.1%
0.45908532722
 
< 0.1%
Other values (299622)299980
> 99.9%
ValueCountFrequency (%)
-0.11803947521
< 0.1%
-0.11802624851
< 0.1%
-0.11801676581
< 0.1%
-0.1179992351
< 0.1%
-0.11798835271
< 0.1%
-0.11796821511
< 0.1%
-0.11795841
< 0.1%
-0.11794905731
< 0.1%
-0.11794666041
< 0.1%
-0.11794323121
< 0.1%
ValueCountFrequency (%)
1.0584433681
< 0.1%
1.0584411031
< 0.1%
1.0584370671
< 0.1%
1.0584253931
< 0.1%
1.0584226631
< 0.1%
1.0583961211
< 0.1%
1.0583862051
< 0.1%
1.0583295921
< 0.1%
1.058327691
< 0.1%
1.0583083941
< 0.1%

cont1
Real number (ℝ)

Distinct299727
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4609256965
Minimum-0.06930914091
Maximum0.8872528309
Zeros0
Zeros (%)0.0%
Negative8826
Negative (%)2.9%
Memory size2.3 MiB

Quantile statistics

Minimum-0.06930914091
5-th percentile0.08695694432
Q10.3104944093
median0.4279026358
Q30.6151127901
95-th percentile0.8368435233
Maximum0.8872528309
Range0.9565619719
Interquartile range (IQR)0.3046183808

Descriptive statistics

Standard deviation0.2140026067
Coefficient of variation (CV)0.464288731
Kurtosis-0.2311100701
Mean0.4609256965
Median Absolute Deviation (MAD)0.1323845775
Skewness0.03267732481
Sum138277.709
Variance0.04579711569
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.4959185652
 
< 0.1%
0.31061724342
 
< 0.1%
0.49094613982
 
< 0.1%
0.56288275742
 
< 0.1%
0.56282537092
 
< 0.1%
0.42964508662
 
< 0.1%
0.35856536492
 
< 0.1%
0.56277099912
 
< 0.1%
0.35480267042
 
< 0.1%
0.82528636812
 
< 0.1%
Other values (299717)299980
> 99.9%
ValueCountFrequency (%)
-0.069309140911
< 0.1%
-0.069307837661
< 0.1%
-0.06930133421
< 0.1%
-0.069295456811
< 0.1%
-0.069287995071
< 0.1%
-0.069278233491
< 0.1%
-0.069255324431
< 0.1%
-0.069243441881
< 0.1%
-0.069241512561
< 0.1%
-0.069238816621
< 0.1%
ValueCountFrequency (%)
0.88725283091
< 0.1%
0.88725255181
< 0.1%
0.88725181971
< 0.1%
0.88725175121
< 0.1%
0.88725102971
< 0.1%
0.88724969191
< 0.1%
0.88724851741
< 0.1%
0.88724786431
< 0.1%
0.88724780111
< 0.1%
0.88724777481
< 0.1%

cont2
Real number (ℝ)

Distinct299738
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4904983051
Minimum-0.05610359404
Maximum1.034704396
Zeros0
Zeros (%)0.0%
Negative11207
Negative (%)3.7%
Memory size2.3 MiB

Quantile statistics

Minimum-0.05610359404
5-th percentile0.04090184979
Q10.3006038001
median0.5024622259
Q30.647512145
95-th percentile0.9635879223
Maximum1.034704396
Range1.09080799
Interquartile range (IQR)0.3469083449

Descriptive statistics

Standard deviation0.2533457756
Coefficient of variation (CV)0.5165069337
Kurtosis-0.3884577917
Mean0.4904983051
Median Absolute Deviation (MAD)0.1725284307
Skewness0.08286588088
Sum147149.4915
Variance0.06418408199
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.53932114112
 
< 0.1%
0.29515685732
 
< 0.1%
-0.02452509342
 
< 0.1%
0.40387386882
 
< 0.1%
0.59156452022
 
< 0.1%
0.60907905312
 
< 0.1%
0.62090606422
 
< 0.1%
0.59480908232
 
< 0.1%
0.5753932412
 
< 0.1%
0.60180838992
 
< 0.1%
Other values (299728)299980
> 99.9%
ValueCountFrequency (%)
-0.056103594041
< 0.1%
-0.056097101791
< 0.1%
-0.056097020471
< 0.1%
-0.056089376151
< 0.1%
-0.056087153341
< 0.1%
-0.056084944081
< 0.1%
-0.056084591681
< 0.1%
-0.056077679271
< 0.1%
-0.056077231991
< 0.1%
-0.056073938431
< 0.1%
ValueCountFrequency (%)
1.0347043961
< 0.1%
1.0347008091
< 0.1%
1.0346997051
< 0.1%
1.0346907941
< 0.1%
1.0346907131
< 0.1%
1.0346864931
< 0.1%
1.0346811541
< 0.1%
1.0346782651
< 0.1%
1.0346774691
< 0.1%
1.0346732821
< 0.1%

cont3
Real number (ℝ≥0)

Distinct299407
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4966894097
Minimum0.1306755236
Maximum1.039560476
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 MiB

Quantile statistics

Minimum0.1306755236
5-th percentile0.1672237103
Q10.3297826578
median0.4650264926
Q30.664450523
95-th percentile0.9163969387
Maximum1.039560476
Range0.9088849524
Interquartile range (IQR)0.3346678652

Descriptive statistics

Standard deviation0.2191989149
Coefficient of variation (CV)0.4413198884
Kurtosis-0.6208709224
Mean0.4966894097
Median Absolute Deviation (MAD)0.1561897046
Skewness0.4027950923
Sum149006.8229
Variance0.04804816428
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.47207569653
 
< 0.1%
0.28721097683
 
< 0.1%
0.47323522262
 
< 0.1%
0.15476344252
 
< 0.1%
0.68709901032
 
< 0.1%
0.45481778972
 
< 0.1%
0.4738946872
 
< 0.1%
0.56607970422
 
< 0.1%
0.47157614562
 
< 0.1%
0.471725262
 
< 0.1%
Other values (299397)299978
> 99.9%
ValueCountFrequency (%)
0.13067552361
< 0.1%
0.13068702121
< 0.1%
0.13069534791
< 0.1%
0.13070289931
< 0.1%
0.13071237951
< 0.1%
0.13071654521
< 0.1%
0.13071837111
< 0.1%
0.13072716951
< 0.1%
0.13072725821
< 0.1%
0.13073950311
< 0.1%
ValueCountFrequency (%)
1.0395604761
< 0.1%
1.0395573981
< 0.1%
1.0395573241
< 0.1%
1.0395542631
< 0.1%
1.0395532341
< 0.1%
1.0395487961
< 0.1%
1.0395482731
< 0.1%
1.0395479081
< 0.1%
1.0395427651
< 0.1%
1.0395412881
< 0.1%

cont4
Real number (ℝ≥0)

Distinct299702
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4916542362
Minimum0.2559077525
Maximum1.055424053
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 MiB

Quantile statistics

Minimum0.2559077525
5-th percentile0.2748932529
Q10.2841879485
median0.3904702598
Q30.6965992161
95-th percentile0.9683590603
Maximum1.055424053
Range0.7995163009
Interquartile range (IQR)0.4124112676

Descriptive statistics

Standard deviation0.2400743146
Coefficient of variation (CV)0.4882990868
Kurtosis-0.7672348416
Mean0.4916542362
Median Absolute Deviation (MAD)0.1141287444
Skewness0.7765983065
Sum147496.2709
Variance0.05763567651
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.49403112972
 
< 0.1%
0.28760404472
 
< 0.1%
0.28689839822
 
< 0.1%
0.28754685972
 
< 0.1%
0.27470289842
 
< 0.1%
0.27679022992
 
< 0.1%
0.28819771992
 
< 0.1%
0.27289596642
 
< 0.1%
0.67923753292
 
< 0.1%
0.28757375332
 
< 0.1%
Other values (299692)299980
> 99.9%
ValueCountFrequency (%)
0.25590775251
< 0.1%
0.25591495841
< 0.1%
0.25591719721
< 0.1%
0.25592271151
< 0.1%
0.25592352411
< 0.1%
0.25593003341
< 0.1%
0.25593941181
< 0.1%
0.25594378171
< 0.1%
0.2559527041
< 0.1%
0.25595568921
< 0.1%
ValueCountFrequency (%)
1.0554240531
< 0.1%
1.0553737051
< 0.1%
1.0553578941
< 0.1%
1.055350931
< 0.1%
1.0553430181
< 0.1%
1.0553249321
< 0.1%
1.0553205031
< 0.1%
1.0553144741
< 0.1%
1.0553106721
< 0.1%
1.0553086351
< 0.1%

cont5
Real number (ℝ≥0)

Distinct299760
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5105263071
Minimum0.04591511232
Maximum1.067649259
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 MiB

Quantile statistics

Minimum0.04591511232
5-th percentile0.112873713
Q10.3541407996
median0.4888650741
Q30.6696250385
95-th percentile0.9384704528
Maximum1.067649259
Range1.021734147
Interquartile range (IQR)0.3154842388

Descriptive statistics

Standard deviation0.2282316365
Coefficient of variation (CV)0.4470516666
Kurtosis-0.3014924822
Mean0.5105263071
Median Absolute Deviation (MAD)0.1495393184
Skewness0.2758216069
Sum153157.8921
Variance0.05208967988
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.39310101613
 
< 0.1%
0.5317665072
 
< 0.1%
0.431856332
 
< 0.1%
0.77968349412
 
< 0.1%
0.60410672962
 
< 0.1%
0.30737858162
 
< 0.1%
0.42267088552
 
< 0.1%
0.4448931822
 
< 0.1%
0.34016495192
 
< 0.1%
0.76639330612
 
< 0.1%
Other values (299750)299979
> 99.9%
ValueCountFrequency (%)
0.045915112321
< 0.1%
0.045915675491
< 0.1%
0.045917635361
< 0.1%
0.045918176011
< 0.1%
0.045920653991
< 0.1%
0.045923424831
< 0.1%
0.045928065431
< 0.1%
0.045929754961
< 0.1%
0.045930892581
< 0.1%
0.045932559591
< 0.1%
ValueCountFrequency (%)
1.0676492591
< 0.1%
1.0676483621
< 0.1%
1.0676469541
< 0.1%
1.0676453631
< 0.1%
1.0676429151
< 0.1%
1.0676426291
< 0.1%
1.0676380191
< 0.1%
1.0676372231
< 0.1%
1.06763711
< 0.1%
1.0676349891
< 0.1%

cont6
Real number (ℝ)

Distinct299737
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4674764173
Minimum-0.2246887528
Maximum1.111551902
Zeros0
Zeros (%)0.0%
Negative3850
Negative (%)1.3%
Memory size2.3 MiB

Quantile statistics

Minimum-0.2246887528
5-th percentile0.1659112647
Q10.3428731589
median0.4293828373
Q30.5733827886
95-th percentile0.8842508354
Maximum1.111551902
Range1.336240655
Interquartile range (IQR)0.2305096298

Descriptive statistics

Standard deviation0.2103314142
Coefficient of variation (CV)0.4499294647
Kurtosis0.975212939
Mean0.4674764173
Median Absolute Deviation (MAD)0.1068207974
Skewness0.5078238038
Sum140242.9252
Variance0.0442393038
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.52434205472
 
< 0.1%
0.35454587462
 
< 0.1%
0.27214914272
 
< 0.1%
0.43156419522
 
< 0.1%
0.37032900312
 
< 0.1%
0.50348513612
 
< 0.1%
0.60192984082
 
< 0.1%
0.61363958862
 
< 0.1%
0.57619598152
 
< 0.1%
0.53690875112
 
< 0.1%
Other values (299727)299980
> 99.9%
ValueCountFrequency (%)
-0.22468875281
< 0.1%
-0.22465401351
< 0.1%
-0.22463540721
< 0.1%
-0.22459001391
< 0.1%
-0.22455280141
< 0.1%
-0.22454952911
< 0.1%
-0.2245356791
< 0.1%
-0.22453267311
< 0.1%
-0.22450740811
< 0.1%
-0.22450523931
< 0.1%
ValueCountFrequency (%)
1.1115519021
< 0.1%
1.1115137881
< 0.1%
1.1114986751
< 0.1%
1.1114962881
< 0.1%
1.111496061
< 0.1%
1.111440031
< 0.1%
1.1114045741
< 0.1%
1.1114002351
< 0.1%
1.1113936341
< 0.1%
1.1113888281
< 0.1%

cont7
Real number (ℝ≥0)

Distinct299710
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.537119483
Minimum0.2037632342
Maximum1.032836733
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 MiB

Quantile statistics

Minimum0.2037632342
5-th percentile0.2449133022
Q10.3558246242
median0.5046607916
Q30.7034407719
95-th percentile0.8973416438
Maximum1.032836733
Range0.8290734992
Interquartile range (IQR)0.3476161477

Descriptive statistics

Standard deviation0.2181399503
Coefficient of variation (CV)0.4061292826
Kurtosis-1.005639094
Mean0.537119483
Median Absolute Deviation (MAD)0.1608973431
Skewness0.4288088454
Sum161135.8449
Variance0.04758503791
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.90188492172
 
< 0.1%
0.41641688272
 
< 0.1%
0.51080311012
 
< 0.1%
0.30799087672
 
< 0.1%
0.31579808232
 
< 0.1%
0.8980884352
 
< 0.1%
0.35348960042
 
< 0.1%
0.87515333082
 
< 0.1%
0.84569960092
 
< 0.1%
0.900021262
 
< 0.1%
Other values (299700)299980
> 99.9%
ValueCountFrequency (%)
0.20376323421
< 0.1%
0.20377058861
< 0.1%
0.20378422521
< 0.1%
0.20378993731
< 0.1%
0.20380083231
< 0.1%
0.20380114651
< 0.1%
0.2038094091
< 0.1%
0.20381081911
< 0.1%
0.20381123571
< 0.1%
0.20381996261
< 0.1%
ValueCountFrequency (%)
1.0328367331
< 0.1%
1.0328317231
< 0.1%
1.0328278641
< 0.1%
1.0328232511
< 0.1%
1.032816811
< 0.1%
1.0328000071
< 0.1%
1.0327938211
< 0.1%
1.0327875781
< 0.1%
1.0327841521
< 0.1%
1.0327823141
< 0.1%

cont8
Real number (ℝ)

Distinct299713
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4984556496
Minimum-0.2602749272
Maximum1.04022859
Zeros0
Zeros (%)0.0%
Negative3705
Negative (%)1.2%
Memory size2.3 MiB

Quantile statistics

Minimum-0.2602749272
5-th percentile0.2426984523
Q10.332485799
median0.4391506753
Q30.606055543
95-th percentile0.9995190733
Maximum1.04022859
Range1.300503517
Interquartile range (IQR)0.2735697441

Descriptive statistics

Standard deviation0.2399197149
Coefficient of variation (CV)0.4813261021
Kurtosis0.2292715337
Mean0.4984556496
Median Absolute Deviation (MAD)0.134182164
Skewness0.5400712176
Sum149536.6949
Variance0.05756146959
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.004048732
 
< 0.1%
0.26642112922
 
< 0.1%
0.26072950542
 
< 0.1%
0.26198242022
 
< 0.1%
0.28195122372
 
< 0.1%
0.25977159462
 
< 0.1%
0.38647702362
 
< 0.1%
0.31717275082
 
< 0.1%
0.27926129482
 
< 0.1%
0.38510354282
 
< 0.1%
Other values (299703)299980
> 99.9%
ValueCountFrequency (%)
-0.26027492721
< 0.1%
-0.2601387011
< 0.1%
-0.26013352321
< 0.1%
-0.2600908711
< 0.1%
-0.26008576761
< 0.1%
-0.26006375241
< 0.1%
-0.2600359261
< 0.1%
-0.26001592241
< 0.1%
-0.26000653521
< 0.1%
-0.25998377491
< 0.1%
ValueCountFrequency (%)
1.040228591
< 0.1%
1.0402282681
< 0.1%
1.0402238981
< 0.1%
1.0402220571
< 0.1%
1.0402103791
< 0.1%
1.0402078221
< 0.1%
1.0402034961
< 0.1%
1.0402032621
< 0.1%
1.0402003391
< 0.1%
1.0401976351
< 0.1%

cont9
Real number (ℝ≥0)

Distinct299684
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4748721486
Minimum0.1178962395
Maximum0.9829224853
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 MiB

Quantile statistics

Minimum0.1178962395
5-th percentile0.1356718113
Q10.3068740022
median0.4346199198
Q30.6143334163
95-th percentile0.8461961535
Maximum0.9829224853
Range0.8650262458
Interquartile range (IQR)0.3074594141

Descriptive statistics

Standard deviation0.2180074275
Coefficient of variation (CV)0.4590865734
Kurtosis-0.7907234503
Mean0.4748721486
Median Absolute Deviation (MAD)0.1447985231
Skewness0.3881285117
Sum142461.6446
Variance0.04752723845
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.55758723012
 
< 0.1%
0.38864942252
 
< 0.1%
0.26544686222
 
< 0.1%
0.12083554842
 
< 0.1%
0.32732832522
 
< 0.1%
0.80605570922
 
< 0.1%
0.40154456742
 
< 0.1%
0.27559134512
 
< 0.1%
0.42733676792
 
< 0.1%
0.16031311032
 
< 0.1%
Other values (299674)299980
> 99.9%
ValueCountFrequency (%)
0.11789623951
< 0.1%
0.11789629761
< 0.1%
0.11789651541
< 0.1%
0.11789694381
< 0.1%
0.11789745941
< 0.1%
0.11789771361
< 0.1%
0.11789781521
< 0.1%
0.11789828721
< 0.1%
0.11789850511
< 0.1%
0.11789867211
< 0.1%
ValueCountFrequency (%)
0.98292248531
< 0.1%
0.98290460051
< 0.1%
0.98288390071
< 0.1%
0.98288105191
< 0.1%
0.98287730971
< 0.1%
0.98286903321
< 0.1%
0.98286709471
< 0.1%
0.98286547651
< 0.1%
0.98286441451
< 0.1%
0.98284331011
< 0.1%

cont10
Real number (ℝ≥0)

Distinct299616
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4744917876
Minimum0.0487320286
Maximum1.055960462
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 MiB

Quantile statistics

Minimum0.0487320286
5-th percentile0.07900451793
Q10.2760172011
median0.459974962
Q30.6915793661
95-th percentile0.8925311325
Maximum1.055960462
Range1.007228434
Interquartile range (IQR)0.415562165

Descriptive statistics

Standard deviation0.2559494629
Coefficient of variation (CV)0.5394181092
Kurtosis-1.082524969
Mean0.4744917876
Median Absolute Deviation (MAD)0.2176403472
Skewness0.05618383762
Sum142347.5363
Variance0.06551012758
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.082438647362
 
< 0.1%
0.34490516112
 
< 0.1%
0.66956680282
 
< 0.1%
0.078124445742
 
< 0.1%
0.65046194922
 
< 0.1%
0.28645616992
 
< 0.1%
0.28596786122
 
< 0.1%
0.32283090032
 
< 0.1%
0.28837519752
 
< 0.1%
0.63235922632
 
< 0.1%
Other values (299606)299980
> 99.9%
ValueCountFrequency (%)
0.04873202861
< 0.1%
0.048732427461
< 0.1%
0.048733900151
< 0.1%
0.048733941061
< 0.1%
0.048734247871
< 0.1%
0.048736385331
< 0.1%
0.048736518281
< 0.1%
0.048737162581
< 0.1%
0.048737755751
< 0.1%
0.04873800121
< 0.1%
ValueCountFrequency (%)
1.0559604621
< 0.1%
1.0559401111
< 0.1%
1.055924161
< 0.1%
1.05591951
< 0.1%
1.0559089191
< 0.1%
1.0558991991
< 0.1%
1.0558947911
< 0.1%
1.0558805961
< 0.1%
1.0558737621
< 0.1%
1.0558510691
< 0.1%

cont11
Real number (ℝ≥0)

Distinct299727
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4732163195
Minimum0.05260750271
Maximum1.071443616
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 MiB

Quantile statistics

Minimum0.05260750271
5-th percentile0.1281474713
Q10.3081505637
median0.4338119282
Q30.6420565412
95-th percentile0.8669996126
Maximum1.071443616
Range1.018836113
Interquartile range (IQR)0.3339059775

Descriptive statistics

Standard deviation0.2220216257
Coefficient of variation (CV)0.4691757586
Kurtosis-0.5027497475
Mean0.4732163195
Median Absolute Deviation (MAD)0.1614613601
Skewness0.3187170744
Sum141964.8959
Variance0.04929360227
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.63061246452
 
< 0.1%
0.6270429982
 
< 0.1%
0.48846850512
 
< 0.1%
0.48251742372
 
< 0.1%
0.27159574142
 
< 0.1%
0.16868957682
 
< 0.1%
0.26754482742
 
< 0.1%
0.26893709552
 
< 0.1%
0.26969172112
 
< 0.1%
0.17888832762
 
< 0.1%
Other values (299717)299980
> 99.9%
ValueCountFrequency (%)
0.052607502711
< 0.1%
0.052611783231
< 0.1%
0.052612457991
< 0.1%
0.052615146491
< 0.1%
0.052619437561
< 0.1%
0.052620534041
< 0.1%
0.052623043311
< 0.1%
0.052623581011
< 0.1%
0.052626258971
< 0.1%
0.052627039171
< 0.1%
ValueCountFrequency (%)
1.0714436161
< 0.1%
1.0714350281
< 0.1%
1.0714274251
< 0.1%
1.0714112231
< 0.1%
1.0714029021
< 0.1%
1.0714011471
< 0.1%
1.0713906821
< 0.1%
1.0713863521
< 0.1%
1.0713788921
< 0.1%
1.0713415641
< 0.1%

cont12
Real number (ℝ)

HIGH CORRELATION

Distinct299657
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.494560612
Minimum-0.07420777541
Maximum0.9750354984
Zeros0
Zeros (%)0.0%
Negative5673
Negative (%)1.9%
Memory size2.3 MiB

Quantile statistics

Minimum-0.07420777541
5-th percentile0.1693536529
Q10.2890741519
median0.4228872583
Q30.7145018573
95-th percentile0.8883649585
Maximum0.9750354984
Range1.049243274
Interquartile range (IQR)0.4254277054

Descriptive statistics

Standard deviation0.2472916426
Coefficient of variation (CV)0.5000229226
Kurtosis-0.9911287576
Mean0.494560612
Median Absolute Deviation (MAD)0.1979372957
Skewness0.1011186488
Sum148368.1836
Variance0.06115315649
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.79177091592
 
< 0.1%
0.41516787952
 
< 0.1%
0.27313500432
 
< 0.1%
0.71021187432
 
< 0.1%
0.16849696912
 
< 0.1%
0.17525720692
 
< 0.1%
0.16806160152
 
< 0.1%
0.27327022252
 
< 0.1%
0.19896590792
 
< 0.1%
0.19834555822
 
< 0.1%
Other values (299647)299980
> 99.9%
ValueCountFrequency (%)
-0.074207775411
< 0.1%
-0.074205627871
< 0.1%
-0.074200310181
< 0.1%
-0.074198142191
< 0.1%
-0.074197180911
< 0.1%
-0.074194010751
< 0.1%
-0.0741932541
< 0.1%
-0.074192108651
< 0.1%
-0.07418944981
< 0.1%
-0.074185706961
< 0.1%
ValueCountFrequency (%)
0.97503549841
< 0.1%
0.97503241131
< 0.1%
0.9750311751
< 0.1%
0.97502921821
< 0.1%
0.97502834931
< 0.1%
0.97502579911
< 0.1%
0.97502565081
< 0.1%
0.97502511391
< 0.1%
0.97502450641
< 0.1%
0.97502437921
< 0.1%

cont13
Real number (ℝ≥0)

Distinct299705
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5082726174
Minimum0.1510501628
Maximum0.9059915785
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 MiB

Quantile statistics

Minimum0.1510501628
5-th percentile0.1947782564
Q10.3006690448
median0.4723998669
Q30.7584467611
95-th percentile0.8621688106
Maximum0.9059915785
Range0.7549414157
Interquartile range (IQR)0.4577777163

Descriptive statistics

Standard deviation0.2229499387
Coefficient of variation (CV)0.4386424354
Kurtosis-1.308751282
Mean0.5082726174
Median Absolute Deviation (MAD)0.1862629387
Skewness0.2433524031
Sum152481.7852
Variance0.04970667518
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.78927367133
 
< 0.1%
0.27402555122
 
< 0.1%
0.291580112
 
< 0.1%
0.45646543982
 
< 0.1%
0.26752257532
 
< 0.1%
0.47329368862
 
< 0.1%
0.16784709282
 
< 0.1%
0.16756845152
 
< 0.1%
0.86799278162
 
< 0.1%
0.48185867242
 
< 0.1%
Other values (299695)299979
> 99.9%
ValueCountFrequency (%)
0.15105016281
< 0.1%
0.15105079581
< 0.1%
0.15106055031
< 0.1%
0.15106139151
< 0.1%
0.15106616661
< 0.1%
0.15106738841
< 0.1%
0.15106827371
< 0.1%
0.15106899881
< 0.1%
0.15107242391
< 0.1%
0.15107709881
< 0.1%
ValueCountFrequency (%)
0.90599157851
< 0.1%
0.90599102291
< 0.1%
0.90598935591
< 0.1%
0.90597564991
< 0.1%
0.90597213071
< 0.1%
0.90596167921
< 0.1%
0.90595163781
< 0.1%
0.90595011641
< 0.1%
0.90594921671
< 0.1%
0.90594650461
< 0.1%

target
Real number (ℝ≥0)

Distinct299613
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.241978552
Minimum0.1403287728
Maximum10.41199175
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 MiB

Quantile statistics

Minimum0.1403287728
5-th percentile7.063491334
Q17.742071304
median8.191373347
Q38.728634343
95-th percentile9.556069375
Maximum10.41199175
Range10.27166298
Interquartile range (IQR)0.986563039

Descriptive statistics

Standard deviation0.7465548095
Coefficient of variation (CV)0.09057956227
Kurtosis0.7132564609
Mean8.241978552
Median Absolute Deviation (MAD)0.4921596385
Skewness0.1764866213
Sum2472593.566
Variance0.5573440836
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6.9563050392
 
< 0.1%
9.107261822
 
< 0.1%
8.0530843012
 
< 0.1%
8.4271426282
 
< 0.1%
8.5228807992
 
< 0.1%
8.5185596762
 
< 0.1%
8.5032607812
 
< 0.1%
8.3981197462
 
< 0.1%
7.0156332882
 
< 0.1%
8.2367331442
 
< 0.1%
Other values (299603)299980
> 99.9%
ValueCountFrequency (%)
0.14032877281
< 0.1%
0.48331443631
< 0.1%
0.57439530591
< 0.1%
0.5920953971
< 0.1%
0.91175245951
< 0.1%
1.2964334741
< 0.1%
1.4368217391
< 0.1%
1.6648088421
< 0.1%
1.8135341171
< 0.1%
1.82222431
< 0.1%
ValueCountFrequency (%)
10.411991751
< 0.1%
10.411898641
< 0.1%
10.411896141
< 0.1%
10.411889881
< 0.1%
10.41188521
< 0.1%
10.411854441
< 0.1%
10.411834021
< 0.1%
10.41182431
< 0.1%
10.411823951
< 0.1%
10.411820071
< 0.1%

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

idcat0cat1cat2cat3cat4cat5cat6cat7cat8cat9cont0cont1cont2cont3cont4cont5cont6cont7cont8cont9cont10cont11cont12cont13target
01BBBCBBAECN0.201470-0.0148220.6696990.1362780.6107060.4003610.1602660.3109210.3894700.2675590.2372810.3778730.3224010.8698508.113634
12BBAABDAFAO0.7430680.3674111.0216050.3657980.2768530.5330870.5589220.5162940.5949280.3414390.9060130.9217010.2619750.4650838.481233
23AAACBDADAF0.7427080.310383-0.0126730.5769570.2850740.6506090.3753480.9025670.5552050.8435310.7488090.6201260.5414740.7638468.364351
34BBACBDAECK0.4295510.6209980.5779420.2806100.2846670.6689800.2390610.7329480.6796180.5748440.3460100.7146100.5401500.2806828.049253
46AAACBDAEAN1.0582910.367492-0.0523890.2324070.2875950.6869640.4206670.6481820.6845010.9566921.0007730.7767420.6258490.2508237.972260
57ABACBDAEGF0.4020560.8370580.7379910.7784290.5302500.3924320.6581690.9974730.5698740.9608640.2380500.3160650.7317290.6947198.028558
68BAAABDAECF0.7436610.2347940.3390260.4240340.2815110.3967050.2734540.8245730.6563250.6771140.8084450.6159730.6316770.2835617.811465
79AAACBBAEAM0.8879590.4827990.6745880.5848110.7630810.6333530.3397600.8020061.0109970.3912210.0572970.5911200.0746290.7758697.674188
810ABACBDAEGI0.5234720.4920590.1654400.7499950.2811100.4725640.4140360.8091421.0133010.7611831.0417110.3939600.7823810.8656108.090095
911AAAABBAEEM0.5030130.5511630.6228710.4715600.2871090.4257160.2337050.4930360.3530480.3346750.0850870.2306340.6367320.2918748.446155

Last rows

idcat0cat1cat2cat3cat4cat5cat6cat7cat8cat9cont0cont1cont2cont3cont4cont5cont6cont7cont8cont9cont10cont11cont12cont13target
299990499980BAACBDAEAF0.5368300.4916640.3604260.4216270.5905160.3477920.8094870.3478890.6083950.8513220.8801450.7917140.8541600.2659878.115353
299991499985AAACBBAEAF0.4107990.5610940.3438580.6029670.5982760.3376920.3225110.3845850.5933780.1197830.4319340.2803140.7255190.3301408.086082
299992499988AAAABDAEAF0.9655610.260241-0.0540030.7131360.2875251.0617840.6521960.7094360.9006160.7404600.9962711.0660310.9221700.2751977.907309
299993499989AAACBDAEEF0.9540270.4948320.5769180.4206670.5911781.0092180.5236280.6541641.0159280.5922010.1882110.1649090.7996940.4394687.975959
299994499992BAACBBAEEI0.4530600.3515240.7226740.5594650.4907400.5040400.6023270.3123371.0248690.9472770.5006640.6625300.6519540.6627029.466745
299995499993BBAABDAEAI0.6971240.4834520.2977700.1958210.3078830.7697920.4505380.9343601.0050770.8537260.4225411.0634630.6976850.5064047.945605
299996499996ABACBBAEEF0.4462000.7151350.6109310.6017300.7367130.5280560.5085020.3582470.2578250.4335250.3010150.2684470.5770550.8236117.326118
299997499997BBACBCAEGF0.5442790.0609370.5909550.9053080.2770740.6887470.3724250.3649360.3832240.5518250.6610070.6296060.7141390.2457328.706755
299998499998ABACBBAEEI0.3000620.6131180.2852130.4068510.8059630.3444040.4242430.3820280.4688190.3510360.2887680.6111690.3802540.3320307.229569
299999499999AAACADAEAO0.8577520.6285280.5396250.3961430.2767850.5653470.3286690.7891650.9604060.7760190.7347070.4843920.6397540.6893178.631146